Systematic Identi cation of Sca olds Representing Di erent Types of Structure-Activity Relationships

نویسندگان

  • Jürgen Bajorath
  • Michael Gütschow
چکیده

In medicinal chemistry, it is of central importance to understand structureactivity relationships (SARs) of small bioactive compounds. Typically, SARs are analyzed on a case-by-case basis for sets of compounds active against a given target. However, the increasing amount of compound activity data that is becoming available allows SARs to be explored on a large-scale. Moreover, molecular sca olds derived from bioactive compounds are also of high interest for SAR analysis. In general, sca olds are obtained by removing all substituents from rings and from linkers between rings. This thesis aims at systematically mining compounds for which activity annotations are available and investigating relationships between chemical structure and biological activities at the level of active compounds, in particular, molecular sca olds. Therefore, data mining approaches are designed to identify sca olds with di erent structural and/or activity characteristics. Initially, sca old distributions in compounds at di erent stages of pharmaceutical development are analyzed. Sets of sca olds that overlap between di erent stages or preferentially occur at certain stages are identi ed. Furthermore, a systematic selectivity pro le analysis of public domain active compounds is carried out. Sca olds that yield compounds selective for communities of closely related targets and represent compounds selective only for one particular target over others are identi ed. In addition, the degree of promiscuity of sca olds is thoroughly examined. Eighty-three sca olds covering 33 chemotypes correspond to compounds active against at least three di erent target families and thus are considered to be promiscuous. Moreover, by integrating pairwise scaffold similarity and compound potency di erences, the propensity of sca olds to form multi-target activity or selectivity cli s and, in addition, the global sca old potential of individual targets are quantitatively assessed, respectively. Finally, structural relationships between sca olds are systematically explored. Most sca olds extracted from active compounds are found to be involved in substructure relationships and/or share topological features with others. These substructure relationships are also compared to, and combined with, hierarchical substructure relationships to facilitate activity prediction.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing the Reengineering Services for the DOK Federated Database System

This paper addresses the design of the reengineering service for federated databases. This service allows the hiding of the heterogeneity of databases involved in a federation by generating object-oriented representations from their corresponding schemata. We propose a complete methodology that supports the identi cation and the translation of both the explicit and implicit information. The ide...

متن کامل

Cognate Identi cation and Phylogenetic Inference : Search for a Better Past

Historical linguistics studies the relationships between languages as they change over time. Occasionally, speakers of a language will split into separate groups for any number of reasons and become isolated from each other. When this happens, the language they shared begins to diverge. These divergences are typically phonological, syntactic, and semantic in nature. These changes in each child ...

متن کامل

Lithofacies identi®cation using multiple adaptive resonance theory neural networks and group decision expert system

Lithofacies identi®cation supplies qualitative information about rocks. Lithofacies represent rock textures and are important components of hydrocarbon reservoir description. Traditional techniques of lithofacies identi®cation from core data are costly and di€erent geologists may provide di€erent interpretations. In this paper, we present a lowcost intelligent system consisting of three adaptiv...

متن کامل

Human Identi cation versus Expression Classi cation via Bagging on Facial Asymmetry

We demonstrate a dual usage of quanti ed facial asymmetry for (1) human identi cation under expression variations and (2) expression classi cation across di erent human subjects. Our experiments show the e ectiveness of using statistical bagging and feature subspace selection BEFORE applying classi ers such as Linear Discriminant Analysis. This preprocessing allows the same type but di erent di...

متن کامل

The FERET Veri cation

Two critical performance characterizations of biometric algorithms including face recognition are identi cation and veri cation In face recognition FERET is the de facto standard evaluation methodology Identi cation performance of face recog nition algorithms on the FERET tests has been previously reported In this paper we report on veri cation performance obtained from the Sep FERET test Re su...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011